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DETAILED ACTION 



Information Disclosure Statement 



1. The references listed in the Information Disclosure Statement submitted on 10/27/2000 
have been considered by the examiner (see attached PTO-1449). 



2. Receipt is acknowledged of papers submitted under 35 U.S. C. 1 19(a)-(d), which papers 
have been placed of record in the file. 



The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth 
in section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

3. Claims 1, 13 and 25 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Miyazawa et al. (US 5,946,658) hereinafter referenced as Miyazawa, in view of Wang et al. (US 
6,505,162 Bl) hereinafter referenced as Wang. 

As per claim 1, Miyazawa discloses cartridge-based, interactive speech recognition 
method with a response creation capability (title), comprising: 



Priority 



Claim Rejections - 35 USC § 103 
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a voice input section inputting voice uttered by a user (column 6, lines 30-49 and Fig. 1, 
'input unit 1, which inputs the speaker's voice'); 

a voice recognition processing section recognizing the voice inputted by the voice input 
section, and converting the recognized voice (speech) into an input symbol string (column 6, 
lines 30-49 and Fig. 1, 'to recognize words spoken', 'speech recognition (converting the 
recognized voice) and dialogue management unit 4', 'recognizable words (symbol string) 5 ); 

a conversation pattern processing section inputting said input symbol string from the 
voice recognition processing section, and outputting an output symbol string corresponding to 
said input symbol string based on a conversation pattern described in advance, (column 4, lines 
1-17 and Fig. 1, 'conversation content storage for retaining canned context rules (broadly 
interpreted as conversation pattern(s) described in advance) and response procedures when 
recognized words or phrases (symbol string) are encountered (input)', response data storage for 
retaining response voice vector data used in formulating an appropriate response to perceived 
and recognized words and corresponding context and action rules'; Fig. 1 and column 6, lines 
38-42, 'speech recognition and dialogue management unit 4', 'response (output) data memory 
unit 4, which stores pre-set (described in advance) response data (necessarily including symbol 
string, since it will output to a voice synthesis unit 6)'); 

a voice synthesis processing section converting said output symbol string from the 
conversation pattern processing section into voice (Fig. 1 and column 6, lines 38-42, 'voice 
synthesis unit 6'); 

a voice output section outputting and uttering the voice from the voice synthesis 
processing section (Fig. 1 and column 6, lines 38-42, 'voice output unit 7'); and 
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Even though Miyazawa discloses changing said output symbol string in accordance with 
the grasped conversation characteristics (column 5, lines 12-14, 'conversation and response 
information can be changed or updated as the general culture (broadly interpreted grasped 
conversation characteristics) changes'; column 12, lines 20-22, 'when a recognition word is 
determined, speech recognition and dialogue manager 4 comprehends (grasp) the meaning of the 
input voice, determine a response ... detects fluctuation data'), Miyazawa does not expressly 
disclose "a conversation characteristic control section grasping conversation characteristics of 
said user based on said input symbol string from said conversation pattern processing section". 
However, this feature is well known in the art as evidenced by Wang who discloses apparatus 
and method of r portable dialogue management using a hierarchical task description table (title), 
comprising a conversational dialogue manager that has a standard control mechanism (a 
conversation characteristic control section), selects (grasp) appropriate dialogue states (broadly 
interpreted as conversation characteristics of the user), executes the response actions, extracts 
(grasp) the domain-dependent factors (conversation characteristics of the user), controls the 
dialogue flow according to semantic input (necessarily including recognized and comprehended 
meaning of the input voice stated above) of a user and the instruction provided by the external 
knowledge base to generate semantic output in response, replaces the knowledge base (column 5, 
lines 43-64), and update the dialogue flow (column 6, line 3). Therefore, it would have been 
obvious to one of ordinary skill in the art at time the invention was made to modify Miyazawa by 
specifically providing a control mechanism for dialogue (conversation) management, as taught 
by Wang, for the purpose of increasing flexibility for a dialogue management. (Wang: column 4, 
lines 5-7). 
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As per claim 13, it recites a method. The rejection is based on the same reason as 
described for claim 1, because the claim recites same or similar limitation(s) as claim 1. 

As per claim 25, it recites a computer readable recording medium. The rejection is based 
on the same reason as described for claim 1, because the claim recites same or similar 
limitation(s) as claim 1 . 

4. Claims 3 and 15 are rejected under 35 U.S. C. 103(a) as being unpatentable over 
Miyazawa et al. (US 5,946,658) hereinafter referenced as Miyazawa, in view of Marx et al. (US 
6,173,266 Bl) hereinafter referenced as Marx. 

As per claim 3, Miyazawa discloses cartridge-based, interactive speech recognition 
method with a response creation capability (title), comprising: 

a voice input section inputting voice uttered by a user (column 6, lines 30-49 and Fig. 1, 
'input unit 1, which inputs the speaker's voice'); 

a voice recognition processing section recognizing the voice inputted by the voice input 
section, and converting the recognized voice (speech) into an input symbol string (column 6, 
lines 30-49 and Fig. 1, 'to recognize words spoken', 'speech recognition (converting the 
recognized voice) and dialogue management unit 4', 'recognizable words (symbol string)'); 

a conversation pattern processing section inputting said input symbol string from the 
voice recognition processing section, and outputting an output symbol string corresponding to 
said input symbol string based on a conversation pattern described in advance, (column 4, lines 
1-17 and Fig. 1, 'conversation content storage for retaining canned context rules (broadly 
interpreted as conversation pattern(s) described in advance) and response procedures when 
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recognized words or phrases (symbol string) are encountered (input) 5 , response data storage for 
retaining response voice vector data used in formulating an appropriate response to perceived 
and recognized words and corresponding context and action rules'; Fig. 1 and column 6, lines 
38-42, 'speech recognition and dialogue management unit 4', 'response (output) data memory 
unit 4, which stores pre-set (described in advance) response data (necessarily including symbol 
string, since it will output to a voice synthesis unit 6)'); 

a voice synthesis processing section converting said output symbol string from the 
conversation pattern processing section into voice (Fig. 1 and column 6, lines 38-42, 'voice 
synthesis unit 6'); 

a voice output section outputting and uttering the voice from the voice synthesis 
processing section (Fig. 1 and column 6, lines 38-42, 'voice output unit 7'). 

Even though Miyazawa discloses a storage section storing words relevant to words 
uttered by the user (column 18, lines 56-67, 'conversation context memory 101 stores the 
information related to the kind of phrases which have been registered and the kind of responses 
that should be issued to each of theses registered phrase'), Miyazawa does not expressly disclose 
storing the words being "synonyms", and "an association function section extracting the 
synonyms and relevant words synonymous with and relevant to words uttered by said user from 
said storage section based on said input symbol string from said conversation pattern processing 
section, and outputting said extracted synonyms and relevant words to said conversation pattern 
processing section". However, this feature is well known in the art as evidenced by Marx who 
discloses system and method for developing interactive speech applications (title), comprising 
dialogue modules (abstract), processing user response by using synonyms (column 9, lines 30- 
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39), editing items and synonyms of the vocabulary using a predefined system dictionary, listing 
synonyms under their items at an indented tier, and adding, deleting, highlighting (broadly 
interpreted as association function of extracting) synonyms of an recognized item (column 19, 
lines 16-33, and Figs. 12 and 13), which corresponds to the claimed limitation. Therefore, it 
would have been obvious to one of ordinary skill in the art at time the invention was made to 
modify Miyazawa by specifically providing storing and extracting synonyms and relevant words 
for the recognized words, as taught by Marx, for the purpose of offering alternative terms of the 
recognized items for a dialogue module (Marx: column 19, lines 10-13). 

As per claim 15, it recites a method. The rejection is based on the same reason as 
described for claim 3, because the claim recites same or similar limitation(s) as claim 3. 

5. Claims 4-5 and 16-17 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Miyazawa et al. (US 5,946,658) hereinafter referenced as Miyazawa, in view of Petkovic (US 
6,185,527 Bl) hereinafter referenced as Petkovic. 

As per claim 4, Miyazawa discloses cartridge-based, interactive speech recognition 
method with a response creation capability (title), comprising: 

a voice input section inputting voice uttered by a user (column 6, lines 30-49 and Fig. 1, 
'input unit 1, which inputs the speaker's voice'); 

a voice recognition processing section recognizing the voice inputted by the voice input 
section, and converting the recognized voice (speech) into an input symbol string (column 6, 
lines 30-49 and Fig. 1, 'to recognize words spoken', 'speech recognition (converting the 
recognized voice) and dialogue management unit 4', 'recognizable words (symbol string)'); 



Application/Control Number: 09/697, 1 46 Page 8 

Art Unit: 2654 

a conversation pattern processing section inputting said input symbol string from the 
voice recognition processing section, and outputting an output symbol string corresponding to 
said input symbol string based on a conversation pattern described in advance, (column 4, lines 
1-17 and Fig. 1, 'conversation content storage for retaining canned context rules (broadly 
interpreted as conversation pattern(s) described in advance) and response procedures when 
recognized words or phrases (symbol string) are encountered (input)', response data storage for 
retaining response voice vector data used in formulating an appropriate response to perceived 
and recognized words and corresponding context and action rules'; Fig. 1 and column 6, lines 
38-42, 'speech recognition and dialogue management unit 4', 'response (output) data memory 
unit 4, which stores pre-set (described in advance) response data (necessarily including symbol 
string, since it will output to a voice synthesis unit 6)'); 

a voice synthesis processing section converting said output symbol string from the 
conversation pattern processing section into voice (Fig. 1 and column 6, lines 38-42, 'voice 
synthesis unit 6'); 

a voice output section outputting and uttering the voice from the voice synthesis 
processing section (Fig. 1 and column 6, lines 38-42, 'voice output unit 7'). 

Even though Miyazawa discloses a storage section storing words relevant to words 
uttered by the user (column 18, lines 56-67, 'conversation context memory 101 stores the 
information related to the kind of phrases which have been registered and the kind of responses 
that should be issued to each of theses registered phrase'), Miyazawa does not expressly disclose 
"a second recognition processing section recognizing broadcast voice from a broadcast station, 
and converting the recognized broadcast voice into second input symbol strings" and "an event 
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data storage section storing said second input symbol strings as event data/' However, this 
feature is well known in the art as evidenced by Petkovic who discloses system and method for 
automatic audio content analysis for word spotting, indexing, classification and retrieval (title), 
comprising audio source, such as a broadcast network, or radio station, speech recognition 
engine (necessarily converting speech (voice) into text (symbol strings) (column 6, lines 48-67), 
and a data storage device having computer usable code means for classifying and indexing 
(herein inherently include storing) audio signals representing audio events, including using a 
speech recognition engine to render words (symbol strings) and alternatives to some words 
(column 17, lines 23-56). Therefore, it would have been obvious to one of ordinary skill in the 
art at time the invention was made to modify Miyazawa by specifically providing a speech 
recognition mechanism for broadcast audio source and a data storage for storing event data 
related to the audio processing, including speech recognition, as taught by Petkovic, for the 
purpose of recognizing many audio applications benefiting from a more generalized ability, 
based on predetermined audio events (Petkovic: column 2, line 67 to column 3, line 4). 

As per claim 5 (depending on claim 4), as stated above, Petkovic discloses using 
broadcast audio source (column 6, lines 48-67), using a data storage device and a speech 
recognition engine to render words (symbol strings) and alternatives to some words from 
(column 17, lines 23-56), which suggests the combined system is capable of implementing 
functionality as claimed "extracting only a symbol string desired by the user among said second 
input symbol strings converted by said second voice recognition section, and storing the 
extracted symbol string in said event data storage section". Therefore, it would have been 
obvious to one of ordinary skill in the art at time the invention was made to modify Miyazawa by 
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specifically providing a mechanism for extracting only words among from broadcast audio 
source and storing them, as taught by Petkovic, for the purpose of recognizing many audio 
applications benefiting from a more generalized ability, based on predetermined audio events 
(Petkovic: column 2, line 67 to column 3, line 4). 

As per claim 16, it recites a method. The rejection is based on the same reason as 
described for claim 4, because the claim recites same or similar limitation(s) as claim 4. 

As per claim 17 (depending on claim 16), the rejection is based on the same reason as 
described for claim 5, because the claim recites same or similar limitation(s) as claim 5. 

6. Claims 6 and 18 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Miyazawa in view of Petkovic as applied to claim 4 above, and further in view of Wang. 

As per claim 6 (depending on claim 4), even though Miyazawa discloses changing said 
output symbol string in accordance with the grasped conversation characteristics (column 5, lines 
12-14, Conversation and response information can be changed or updated as the general culture 
(broadly interpreted grasped conversation characteristics) changes'; column 12, lines 20-22, 
£ when a recognition word is determined, speech recognition and dialogue manager 4 
comprehends (grasp) the meaning of the input voice, determine a response . . . detects fluctuation 
data'), Miyazawa in view of Petkovic does not expressly disclose "a conversation characteristic 
control section grasping conversation characteristics of said user based on said input symbol 
string from said conversation pattern processing section". However, this feature is well known 
in the art as evidenced by Wang who discloses apparatus and method of r portable dialogue 
management using a hierarchical task description table (title), comprising a conversational 
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dialogue manager that has a standard control mechanism (a conversation characteristic control 
section), selects (grasp) appropriate dialogue states (broadly interpreted as conversation 
characteristics of the user), executes the response actions, extracts (grasp) the domain-dependent 
factors (conversation characteristics of the user), controls the dialogue flow according to 
semantic input (necessarily including recognized and comprehended meaning of the input voice 
stated above) of a user and the instruction provided by the external knowledge base to generate 
semantic output in response, replaces the knowledge base (column 5, lines 43-64), and update 
the dialogue flow (column 6, line 3). Therefore, it would have been obvious to one of ordinary 
skill in the art at time the invention was made to modify Miyazawa in view of Petkovic by 
specifically providing a control mechanism for dialogue (conversation) management, as taught 
by Wang, for the purpose of increasing flexibility for a dialogue management. (Wang: column 4, 
lines 5-7). 

As per claim 18 (depending on claim 16), the rejection is based on the same reason as 
described for claim 6, because the claim recites same or similar limitation(s) as claim 6. 

7. Claims 7 and 19 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Miyazawa in view of Petkovic as applied to claim 4 above, and further in view of Marx. 

As per claim 7 (depending on claim 4), even though Miyazawa discloses a storage 
section storing words relevant to words uttered by the user (column 18, lines 56-67, 
'conversation context memory 101 stores the information related to the kind of phrases which 
have been registered and the kind of responses that should be issued to each of theses registered 
phrase'), Miyazawa in view of Petkovic does not expressly disclose storing the words being 
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"synonyms", and "an association function section extracting the synonyms and relevant words 
synonymous with and relevant to words uttered by said user from said storage section based on 
said input symbol string from said conversation pattern processing section, and outputting said 
extracted synonyms and relevant words to said conversation pattern processing section". 
However, this feature is well known in the art as evidenced by Marx who discloses system and 
method for developing interactive speech applications (title), comprising dialogue modules 
(abstract), processing user response by using synonyms (column 9, lines 30-39), editing items 
and synonyms of the vocabulary using a predefined system dictionary, listing synonyms under 
their items at an indented tier, and adding, deleting, highlighting (broadly interpreted as 
association function of extracting) synonyms of an recognized item (column 19, lines 16-33, and 
Figs. 12 and 13), which corresponds to the claimed limitation. Therefore, it would have been 
obvious to one of ordinary skill in the art at time the invention was made to modify Miyazawa in 
view of Petkovic by specifically providing storing and extracting synonyms and relevant words 
for the recognized words, as taught by Marx, for the purpose of offering alternative terms of the 
recognized items for a dialogue module (Marx: column 19, lines 10-13). 

As per claim 19 (depending on claim 16), the rejection is based on the same reason as 
described for claim 7, because the claim recites same or similar limitation(s) as claim 7. 

8. Claims 8, 12, 20 and 24 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Miyazawa in view of Petkovic, and further in view of Kanevsky et al. (US 6,236,968 Bl) 
hereinafter referenced as Kanevsky. 
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As per claim 8, Miyazawa discloses cartridge-based, interactive speech recognition 
method with a response creation capability (title), comprising: 

a voice input section inputting voice uttered by a user (column 6, lines 30-49 and Fig. 1, 
'input unit 1, which inputs the speaker's voice'); 

a voice recognition processing section recognizing the voice inputted by the voice input 
section, and converting the recognized voice (speech) into an input symbol string (column 6, 
lines 30-49 and Fig. 1, 'to recognize words spoken 5 , 'speech recognition (converting the 
recognized voice) and dialogue management unit 4', 'recognizable words (symbol string)'); 

a conversation pattern processing section inputting said input symbol string from the 
voice recognition processing section, and outputting an output symbol string corresponding to 
said input symbol string based on a conversation pattern described in advance, (column 4, lines 
1-17 and Fig. 1, 'conversation content storage for retaining canned context rules (broadly 
interpreted as conversation pattern(s) described in advance) and response procedures when 
recognized words or phrases (symbol string) are encountered (input)', response data storage for 
retaining response voice vector data used in formulating an appropriate response to perceived 
and recognized words and corresponding context and action rules'; Fig. 1 and column 6, lines 
38-42, 'speech recognition and dialogue management unit 4', 'response (output) data memory 
unit 4, which stores pre-set (described in advance) response data (necessarily including symbol 
string, since it will output to a voice synthesis unit 6)'); 

a voice synthesis processing section converting said output symbol string from the 
conversation pattern processing section into voice (Fig. 1 and column 6, lines 38-42, 'voice 
synthesis unit 6'); 
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a voice output section outputting and uttering the voice from the voice synthesis 
processing section (Fig. 1 and column 6, lines 38-42, 'voice output unit 7'), 

Even though Miyazawa discloses a storage section storing words relevant to words 
uttered by the user (column 18, lines 56-67, 'conversation context memory 101 stores the 
information related to the kind of phrases which have been registered and the kind of responses 
that should be issued to each of theses registered phrase'), Miyazawa does not expressly disclose 
"an event data storage section storing the event data". However, this feature is well known in 
the art as evidenced by Petkovic who discloses system and method for automatic audio content 
analysis for word spotting, indexing, classification and retrieval (title), comprising a data storage 
device having computer usable code means, including using predetermined audio features being 
reprehensive of respective audio events (event data, which is necessarily stored in the data 
storage), and using a speech recognition engine to render words (interpreted as event data since 
they are directly associated to the audio events) and alternatives to some words (column 17, lines 
23-56). Therefore, it would have been obvious to one of ordinary skill in the art at time the 
invention was made to modify Miyazawa by specifically providing a data storage for storing 
event data relating the audio processing, as taught by Petkovic, for the purpose of recognizing 
many audio applications benefiting from a more generalized ability, based on predetermined 
audio events (Petkovic: column 2, line 67 to column 3, line 4). 

Miyazawa in view of Petkovic does not expressly disclose a mechanism "transmitting 
said input symbol string to the other interactive system as event data, and receiving event data 
consisting of the symbol string from said other interactive system". However, this feature is well 
known in the art as evidenced by Kanevsky who disclose that an automatic dialog system 
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includes a speech recognition module and a speech generation module (abstract), comprising an 
external service provider adapter 122 for facilitating communications between the dialog car 
system and a remotely located external service provider 123, the external provider receives alert 
signals (necessarily includes some symbol string) and the actual content of conversation will be 
transferred from the external service provider 123 to the conversational planner 107 (column 5, 
lines 22-64 and Fig. 1). Therefore, it would have been obvious to one of ordinary skill in the art 
at time the invention was made to modify Miyazawa in view of Petkovic by specifically 
providing a mechanism for transmitting processed data (symbol string) from one interactive 
system to the other, as taught by Kanevsky, for the purpose of updating the appropriate modules 
with new data (Kanevsky: column 5, lines 40-51). 

As per claim 12 (depending on claim 8), Petkovic further discloses audio source, such as 
a broadcast network, or radio station, speech recognition engine (necessarily converting speech 
(voice) into text (symbol strings) (column 6, lines 48-67), and a data storage device having 
computer usable code means for classifying and indexing (herein inherently include storing) 
audio signals representing audio events, including using a speech recognition engine to render 
words (symbol strings) and alternatives to some words (column 17, lines 23-56), which 
corresponds to the claimed "a second recognition processing section recognizing broadcast voice 
from a broadcast station, and converting the recognized broadcast voice into second input 
symbol strings" and "an event data storage section storing said second input symbol strings as 
event data". Therefore, it would have been obvious to one of ordinary skill in the art at time the 
invention was made to modify the combined system as stated above by specifically providing a 
speech recognition mechanism for broadcast audio source and a data storage for storing event 
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data related to the audio processing, including speech recognition, as taught by Petkovic, for the 
purpose of recognizing many audio applications benefiting from a more generalized ability, 
based on predetermined audio events (Petkovic: column 2, line 67 to column 3, line 4). 

As per claim 20, it recites a method. The rejection is based on the same reason as 
described for claim 8, because the claim recites same or similar limitation(s) as claim 8. 

As per claim 24 (depending on claim 20), the rejection is based on the same reason as 
described for claim 12, because the claim recites same or similar limitation(s) as claim 12. 

9. Claims 10 and 22 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Miyazawa in view of Petkovic and Kanevsky as applied to claim 8 above, and further in view of 
Wang. 

As per claim 10 (depending on claim 8), the rejection is based on the same reason as 
described for claim 6, because the claim recites same or similar limitation(s) as claim 6. 

As per claim 22 (depending on claim 20), the rejection is based on the same reason as 
described for claim 10, because the claim recites same or similar limitation(s) as claim 10. 

10. Claims 1 1 and 23 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Miyazawa in view of Petkovic and Kanevsky as applied to claim 8 above, and further in view of 
Marx. 

As per claim 11 (depending on claim 8), the rejection is based on the same reason as 
described for claim 7, because the claim recites same or similar limitation(s) as claim 7. 
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As per claim 23 (depending on claim 20), the rejection is based on the same reason as 
described for claim 11, because the claim recites same or similar limitation(s) as claim 1 1 . 



Allowable Subject Matter 
1 1 . Claims 2, 9, 14 and 21 are objected to as being dependent upon a rejected base claim, but 
would be allowable if rewritten in independent form including all of the limitations of the base 
claim and any intervening claims. 

The following is an examiner's statement of reasons for allowance: 

Regarding claim 2 (depending on claim 1), the prior art of record fail to specifically 
disclose or fairly suggest the interactive speech recognition and speech synthesis system with 
distinct combined features that comprises: 

a conversation characteristic control section grasping conversation characteristics of said 
user based on said input symbol string from said conversation pattern processing section, and 
changing said output symbol string in accordance with the grasped conversation characteristics 
(inherited from claim 1); and 

a conversation characteristic control section that optimizes a conversation topic, a 
conversation pattern, uttered words, a conversation response interval utterance frequency and 
timing during the utterance based on words and a conversation pattern used by the user with high 
frequency, a specific topic, times of conversation responses and utterance time and the number 
of words uttered by the user . 
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Regarding claim 9 (depending on claim 8), the prior art of record fail to specifically 
disclose or fairly suggest the interactive speech recognition and speech synthesis system with 
distinct combined features that comprises: 

an event data transmission and reception section transmitting said input symbol string to 
the other interactive system as event data, and receiving event data consisting of the symbol 
string from said other interactive system; an event data storage section storing the event data 
received by the event data transmission and reception section from said other interactive system, 
(inherited from claim 8); and 

an input section inputting a privacy protection command of said user; and a privacy 
protecting function section transmitting said event data acquired by utterance of said user to said 
other interactive system if said privacy protection command is not inputted from the input 
section, and not transmitting said event data acquired by the utterance of said user to said other 
interactive system if said privacy protection command is inputted from the input section . 

Regarding claim 14 (depending on claim 13), the allowance is based on the same reason 
as described for claim 2, because the claim recites same or similar limitation(s) as claim 2. 

Regarding claim 21 (depending on claim 20), the allowance is based on the same reason 
as described claim for 9, because the claim recites same or similar limitation(s) as claim 9. 
12. The prior art, Miyazawa et al. (US 5,946,658), Wang et al. (US 6,505,162 Bl), Marx et 
al. (US 6,173,266 Bl), Petkovic (US 6,185,527 Bl) and Kanevsky et al. (US 6,236,968 Bl), of 
record provided numerous teachings or alternative approach for speech and speaker recognition 
with noise, including generating background noise models, spectral domain analysis, measuring 
the average (mean) and variance of a signal, estimating noise statistical properties, selective 
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addition of noise, adding and subtracting mean, modifying noise speech. However, the features 
as presented above are not anticipated by, nor made obvious by the prior art of the record. 

Any comments considered necessary by applicant must be submitted no later than the 
payment of the issue fee and, to avoid processing delays, should preferably accompany the issue 
fee. Such submissions should be clearly labeled "Comments on Statement of Reasons for 
Allowance." 



1 3 . Any response to this action should be mailed to: 

Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 
or faxed to: 

(703) 872-9306, (for formal communications intended for entry) 

Or: 



Patent Correspondence delivered by hand or delivery services, other than the USPS, 
should be addressed as follows and brought to U.S. Patent and Trademark Office, 220 20th 
Street S., Customer Window, Crystal Plaza Two, Lobby, Room 1B03, Arlington, VA, 22202 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Qi Han whose telephone numbers is (703) 305-563 1 . The 
examiner can normally be reached on Monday through Thursday from 9:00 a.m. to 7:00 p.m. If 
attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, 
Richemond Dorvil, can be reached on (703) 305-6954. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Inquiries regarding the status of submissions 
relating to an application or questions on the Private PAIR system should be directed to the 
Electronic Business Center (EBC) at 866-217-9197 (toll-free) or 703-305-3028 between the 
hours of 6 a.m. and midnight Monday through Friday EST, or by e-mail at: ebc@uspto.gov. For 
general information about the PAIR system, see http://pair-direct.uspto.gov. 



Conclusion 



(703) 872-9306, (for informal or draft communications, and please label 
"PROPOSED" or "DRAFT") 



QH/qh 

September 13,2004 




